Learning by Experience and by Imitation in Multi-Robot Systems

نویسندگان

  • Dennis Barrios-Aranibar
  • Luiz M. G. Gonçalves
  • Pablo Javier Alsina
چکیده

With the increasing number of robots in industrial environments, scientists/technologists were often faced with issues on cooperation, coordination and collaboration among different robots and their self governance in the work space. This has led to the development of systems with several cooperative robotic agents. (Kim et al., 1997b). Generally, a system with several robotic agents (multi-robot system) is composed by two or more robots executing a task in a cooperative way (Arai and Ota, 1992). Coordination, collaboration and cooperation are three terms used without distinction when working with multi-agent and multi-robot systems. In this work, we adopt a definition proposed by Noreils (Noreils, 1993) in which cooperation occurs when several agents (or robots) are gathered together so as to perform a global task. Coordination and collaboration are two forms of cooperation (Botelho and Alami, 2000). Coordination occurs when an entity coordinates its activity with another, or it synchronizes its action with respect to the other entity, by exchanging information, signals, etc. And, collaboration occurs when two or more agents decompose a global task in subtasks to be performed by each specific agent. Generally, the solution for problems using multi-agent and multi-robot systems is divided into stages. When talking about autonomous robots, two of these stages are the task allocation stage and the task execution stage. Task allocation should be done so that all components (agents or robots) in the system are used and the problem is completely solved. The task execution stage itself should be performed so that the agents do not interfere to each other (coordination and/or collaboration) when solving the problem. Traditionally, both stages are carried out independently. In the task allocation stage, it is defined if the agents will collaborate to each other or if they will coordinate their activities. In the task execution stage, collaboration and/or coordination are effectively done. In the literature, each stage is implemented using different techniques. The task allocation stage can be implemented using centralized or decentralized approaches (Le Pape, 1990). Centralized approaches can be implemented as an optimization problem. Decentralized approaches generally use marked based approaches like the contract-net protocol (CNP) (Ulam et al., 2007) or other approaches derived from it (Botelho and Alami, 1999). The task execution stage can be implemented in many ways. It depends of the nature of interactions between agents (Collaboration or coordination) and if agents can modify or not their strategies (Static and dynamic strategies). For example it can be implemented using

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Embodied imitation-enhanced reinforcement learning in multi-agent systems

Imitation is an example of social learning in which an individual observes and copies another’s actions. This paper presents a new method for using imitation as a way of enhancing the learning speed of individual agents that employ a well-known reinforcement learning algorithm, namely Q-learning. Compared to other research that uses imitation with reinforcement learning, our method uses imitati...

متن کامل

Learning Strategies for Coordination of Multi Robot Systems: a Robot Soccer Application

This paper presents a hybrid method for learning a dynamic strategy for a robot soccer team. In this method, an imitation learning scheme based on observed robot soccer games is used as a seed for an experience-guided learning scheme based on reinforcement learning. A lack in the application of classic reinforcement learning to the robot soccer problem is the high number of states to be analyze...

متن کامل

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...

متن کامل

Vision-Based Imitation Learning in Heterogeneous Multi-Robot Systems: Varying Physiology and Skill

Imitation learning enables a learner to improve its abilities by observing others. Most robotic imitation learning systems only learn from demonstrators that are similar physically and in terms of skill level. In order to employ imitation learning in a heterogeneous multi-agent environment, we must consider both differences in skill, and physical differences (physiology, size). This paper descr...

متن کامل

Situated robot learning for multi-modal instruction and imitation of grasping

A key prerequisite to make user instruction of work tasks by interactive demonstration effective and convenient is situated multi-modal interaction aiming at an enhancement of robot learning beyond simple low-level skill acquisition. We report the status of the Bielefeld GRAVIS-robot system that combines visual attention and gestural instruction with an intelligent interface for speech recognit...

متن کامل

Cost-Based Policy Mapping for Imitation

Imitation represents a powerful approach for programming and autonomous learning in robot and computer systems. An important aspect of imitation is the mapping of observations to an executable control strategy. This is particularly important if the behavioral capabilities of the observed and imitating agent differ significantly. This paper presents an approach that addresses this problem by loc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012